TTGTAACAGA AAATTAAAAT ATACTCCACT CAAGGGAATT CTGTACTTTG CCCTTTTGGT -9 9 
AAA GT C T C AT TTACATTTCT AAACCTTTCT TAAGAAAATC GAATTTCCTT TGATCTCTCT -3 9 
1 ■ " -1MTSCHI 

TCTGAATTGC AGAAATCAGA TAAAAACTAC TTGGTGAA ATG ACT TCT TGT CAC ATT 18 



7 


A 


E 


E 


H 


I 


Q 


K 


V 


A 


I 


F 


G 


G 


T 


H 


G 






GCT 


GAA 


GAA 


CAT 


ATA 


CAA 


AAG 


GTT 


GCT 


ATC 


TTT 


GGA 


GGA 


ACC" 


CAT 


GGG 


66 


23 


N 


E 


L 


T 


G 


V 


F 


Li 


V 


K 


H 


T.T 
W 


T 
Li 


E 


N 


G 






AAT 


GAG 


CTA 


ACC 


GGA 


GTA 


TTT 


CTG 


GTT 


AAG 


CAT 


TGG 


CTA 


GAG 


AAT 


GGC 


114 


39 


A 


E 


I 


Q 


.R 


T 


G 


L 


E 


V 


K 


P 


F 


I 


T 


N 






GCT 


GAG 


ATT 


CAG 


AGA 


ACA 


GGG 


CTG 


GAG 


GTA 


AAA 


CCA 


TTT 


ATT 


ACT 


AAC 


162 


55 


P 


R 


A 


V 


K 


K 


C 


T 


R 


XT 

Y 


I 


D 


C 


D 


L 


N 






CCC 


AGA 


GCA 


GTG 


AAG 


AAG 


TGT 


ACC 


AGA 


TAT 


ATT 


GAC 


TGT 


GAC 


CTG 


AAT 


210 


71 


R 


I 


F 


D 


L 


E 


N 


L 


G 


K 


K 


M 


S 


E 


D 


L 






CGC 


ATT 


TTT 


GAC 


CTT 


GAA 


AAT 


CTT 


GGC 


AAA 


AAA 


ATG 


TCA 


GAA 


GAT 


TTG 


258 


87 


P 


Y 


E 


V 


R 


R 


- A 


Q 


E 


I 


N 


H 


L 


F 


G 


P 




o 


CCA 


TAT 


GAA 


GTG 


AGA 


AGG 


GCT 


CAA 


GAA 


ATA 


AAT 


CAT 


TTA 


TTT 


GGT 


CCA 


306 




K 


D 


S 


E 


D 


S 


Y 


D 


I 


I 


F 


D 


L 


H 


* 

N 


T 






AAA 


GAC 


AGT 


GAA 


GAT 


TCC 


m tv m 

TAT 


GAC 


tv mm 

ATT 


ATT 


TTT 


GAC 


CTT 


CAC 


AAC 


ACC 


3 54 


l£i 


T 


S 


N 


M 


G 


C 


T 


L 


I 


L 


E 


D 


S 


R 


N 


N 




m 


ACC 


TCT 


AAC 


ATG 


GGG 


TGC 


ACT 


CTT 


ATT 


CTT 


GAG 


GAT 


TCC 


AGG 


AAT 


AAC 


402 




F 


L 


I 


Q 


M 


F 


H 


Y 


I 


K 


T 


S 


L 


A 


P 


L 






TTT 


TTA 


ATT 


CAG 


ATG 


TTT 


CAT 


TAC 


ATT 


AAG 


ACT 


TCT 


CTG 


GCT 


CCA 


CTA 


450 


15T: 


P 


C 


Y 


V 


Y 


L 


I 


E 


H 


P 


s 


L 


K 


Y 


A 


T 






CCC 


TGC 


TAC 


GTT 


TAT 


CTG 


ATT 


GAG 


CAT 


CCT 


TCC 


CTC 


AAA 


TAT 


GCG 


ACC 


498 


167 


T 


R 


S 


I 


A 


K 


Y 


P 


V 


G 


I 


E 


V 


G 


P 


Q 




'ass. 


ACT 


CGT 


TCC 


ATA 


GCC 


AAG 


TAT 


CCT 


GTG 


GGT 


ATA 


GAA 


GTT 


GGT 


CCT 


CAG 


546 




P 


Q 


G 


V 


L 


R 


A 


D 


I 


L 


D 


Q 


M 


R 


K 


M 






CCT 


CAA 


GGG 


GTT 


CTG 


AGA 


GCT 


GAT 


ATC 


TTG 


GAT 


CAA 


ATG 


AGA 


AAA 


ATG 


594 


l^S 


I 


K 


H 


A 


L 


D 


F 


I 


H 


H 


F 


N 


E 


G 


K 


E 






ATT 


AAA 


CAT 


GCT 


CTT 


GAT 


TTT 


ATA 


CAT 


CAT 


TTC 


AAT 


GAA 


GGA 


AAA 


GAA 


642 


2 jrS 


F 


P 


P 


C 


A 


I 


E. 


V 


Y 


K 


I 


I 


E 


K 


V 


D 






TTT 


CCT 


CCC 


TGC 


GCC 


ATT 


GAG 


GTC 


TAT 


AAA 


ATT 


ATA 


GAG 


AAA 


GTT 


GAT 


690 


2 31 


Y 


P 


R 


D 


E 


N 


G 


E 


I 


A 


A 


I 


I 


H 


P 


N 






TAC 


CCC 


CGG 


GAT 


GAA 


AAT 


GGA 


GAA 


ATT 


GCT 


GCT 


ATC 


ATC 


CAT 


CCT 


AAT 


738 


247 


L 


Q 


D 


Q 


D 


W 


K 


P 


L 


H 


P 


G 


D 


P 


M 


F 






CTG 


CAG 


GAT 


CAA 


GAC 


TGG 


AAA 


CCA 


CTG 


CAT 


CCT 


GGG 


GAT 


CCC 


ATG 


TTT 


786 


263 


L 


T 


L 


D 


G 


K 


T 


I 


P 


L 


G 


G 


D 


C 


T 


V 






TTA 


ACT 


CTT 


GAT 


GGG 


AAG 


ACG 


ATC 


CCA 


CTG 


GGC 


GGA 


GAC 


TGT 


ACC 


GTG 


834 


279 


Y 


P 


V 


F 


V 


N 


E 


A 


A 


Y 


Y 


E 


K 


K 


E 


A 






TAC 


CCC 


GTG 


TTT 


GTG 


AAT 


GAG 


GCC 


GCA 


TAT 


TAC 


GAA 


AAG 


AAA 


GAA 


GCT 


882 


295 


F 


A ! 


K 


T 


T 


K 


L 


T 


L 


N 


A 


K 


S 


I 


R 


C 






TTT 


GCA 


AAG 


ACA 


ACT 


AAA 


CTA 


ACG 


CTC 


AAT 


GCA 


AAA 


AGT 


ATT 


CGC 


TGC 


930 


311 


C 


L 


H 































TGT TTA CAT TAG AA ATCACTTCCA GCTTACATCT TACACGGTGT CTTACAAATT 9 84 
CTGCTAGTCT GTAAGCTCCT TAAGAGTAGG GTTGTGCCTT ATTCAACTGC ATACATAGCT 104 4 
C CT AG CAC AG TGCCTTATTC GGTAGGCATC TAAGCAAATT TCTTAAATTA ATTAATATAT 1104 
CTTTAAAGAT AT C AT ATTTT ATGTATGTAG CTTATTCAAA GAAGTGTTTC CTATTTCTAT 116 4 



ATAGTTTATT ATACATGATA CTTGGGTAGC TCAACATTCT TAATAAACAG C CTTTGT ATT 12 3 4 
CAGAAj|pC|K^TTGAAATA GATATATATA AAGTTAAAAA i^AAiXAAAA AAA 12 8 7 



Fig. 1 



lOv 20v 30v 40v 50v 

HLASP MTSCHIAEEHIQKVAIFC^THGNELTGVFLVKHWLENGAEIQRTGLEVKPF 

MTSCH : AE : . I : KVAIFGGTHGNELTGVFLVKHWLEN : : EIQRTGLEVKPF 

BASPCDNA MTSCHVAEDPIKKVAIFG^ 

10" 3Q ^ 4Q ^ 5Q ^ 

60v 70V 80V 90v lOOv 

HLASP ITNPRAVKKCTRYIDCDLNRIFDLENLGKKKJSEDLPYEVRRAQEINHLFGP 
ITNPRAVKKCTRYIDCDLNR: FD ENLGKK. SEDLPYEVRRAQEINHLFGP 
BASPCDNA ITNPRAVKKCTRYIDCDLNRVFDPENLGKKKSEDLPYEVRRAQEINHLFGP 

60" 70" 80" 90" 100" 

110V 120v 130v 140V 150V 

HLASP KE^ElSSYDIIFDLHN*TTSNMGCTLILEDSRNNFLIQMFHYIKTSLAPLPCY 
O KDSEDSYDIIFDLHN*TTSNMGCTLILEDSRN: FLIQMFHYIKTSLAPLPCY 

BASPCDNA KDSECaSYDIIFDLHN^TTSNMGCTLILEDSRNDFLIQMFHYIKTSLAPLPCY 
ig 110" 120" 130" 140" 150" 

ff\ 160v 170v 180v 190v 200v 

Hjy&SP VYLIEHPSLKYATTRSIAKYPVGIEVGPQPQGVLRADILDQMRKMIKHALD 
fQ VYLI EHP SLKYATTRS I AK YP VG I EVGPQPQGVLRAD I LDQMRKM I : HALD 

B$gPCDNA VYLIEHPSLKYATTRSIAKYPVGIEVGPQPQGVLRADILDQMRKMIQHALD 
160" 170" 180" 190" 200~ 

210v 220v 230v 240v 250v 

H^ASP FIHHFNEGKEFPPCAIEVYKIIEKVDYPRDENGEIAAIIHPNLQDQDWKPL 
U FIH : FNEGKEFPPCAIEVYKI : KVDYPR : E : GEI : AIIHP : LQDQDWKPL 

B^gSPCDNA FIHNFNEGKEFPPCAIEVYKIMRKVDYPRNESGEISAIIHPKLQDQDWKPL 
J~f 210" 220" 230" 240" 250" 

^ 260v 270v 280v 290v 300v 

hMsp hpgdpmfi^ldgktiplggdSI^ypvf^'S 

"p* HP . DP : FLTLDGKTIPLGGD i^YPVF^im^YEKKEAFAKTTKLTLNA : 

BASPCDNA HPEDPVFL^LDGKTIPLGGi^t^ 

260" 270" #0" . ^'^$g : $~ 300^ 

310v 

HLASP SIRCCLH 

SIR, • LH 
BASPCDNA SIRSSLH 

310" 



Fig. 2 




Fig. 3 
Kaul et al. 
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Tig. 4 

Kaul et al, 




Fig. 5 



Kaul 



et al . 




Fig. 6 
Kaul et al . 



MAPSEQ V5.33 .HASP . SEQ ( 1 , 1277 ) Reading frames : I Enzyme fiie' ALL.ENZ LinPage 1 



(*4t 

*4 



EAM 


M 


E - 


M 


N 


DSNDSBB 


ASA 


B 


C 


N 


L 


STCSESS 


MPE 


O 


5 


L 


' A 


AYOACAA 


1E3 


2 


7 


1 


4 


11111JJ 



/ ///// 

ATGACTTCTTGTCACATTGCTGAAGAACATATACAAAAGGTTGCTATCTTTGGAGGAACC 

* + + . + . + . + . h 6 0 

TACTGAAGAACAGTGTAACGACTTCTTGTATATGTTTTCCAACGATAGAAACCTCCTTGG 



m t 



c h i . a 



e h 



k v a 



f g g t 



N 
L 
A 
3 



A 
L 
U 
1 



BBH 
SCP 
AAA 
W7 2 
// 



j 



TSM 
RPS 
UOE 
911 
/ 



RM 
MA 
AE 
11 
/ 



* + 



H HHD TH 

I HAD FN 

N AEE IF 

P 121 11 

/ / 



CATGGGAATGAGCTAACCGGAGTATTTCTGGT^AAGCATTGGCTAGAGAATGGCGCTGAG' 

. + , s + . + . + . + . + 12 0 

GTACCCTTACTCGATTGGCCTCATAAAGACCAATTCGTAACCGATCTCTTACCGCGACTC 



h g n 



1 t 



g v 



v k h w 1 



e n g a 



-+ 



M 
N 
L 
1 



B 
P 
M 
1 



ATTCAGAGAACAGGGCTGGAGGTAAAACCATTTATTACTAACCCCAGAGCAGTGAAGAAG 

+ . + . + . + . + . + 180 

TAAGTCTCTTGTGCCGACCTCCATTTTGGTAAATAATGATTGGGGTCTCGTCACTTCTTC 
iqrtglevk pfitnpravkk 



-+- 



-+ 




' us 



/ 



( 



J^SEQ V5.33 HASP . SEQ ( 1 , 127TT Reading frames: l Enzyme file ALL - EN Z LinPage 2 

CR • M ' M TH 

SS B A FN 

PA O E - IF - 

612 3 11.' 

/ 

TGTACCAGATATATTGACTGTGACCTGAATCGCATTTTTGACCTTGAAAATCTTGGCAAA 

. + . + . + . . + . + 24 0 

ACATGGTCTATATAACTGACACTGGACTTAGCGTAAAAACTGGAACTTTTAGAACCGTTT 

ctryidcdln rifdlenl gk 
. + . + . + . + . + . + 



NM BN 
DB AS 
EO NP 
12 22 

/ 

AAAATGTCAGAAGATTTGCCATATGAAGTGAGAAGGGCTCAAGAAATAAATCATTTATTT 

. + . + . + . + . + — - — . + 3 00 

TTTTACAGTCTTCTAAACGGTATACTTCACTCTTCCCGAGTTCTTTATTTAGTAAATAAA 

kmsedlpyevrraqeinhlf 



-+- 



A TH M S 

V FN B P 

A IF O O 

2 11 2 1 

/ 

GGTCCAAAAGACAGTGAAGATTCCTATGACATTATTTTTGACCTTCACAACACCACCTCT 
. + . + . + „ + . + . + 3 60 

CCAGGTTTTCTGTCACTTCTAAGGATACTGTAATAAAAACTGGAAGTGTTGTGGTGGAGA 
gp k dsedsydiifdlhntts 



.7 ft") 



?TSEQ V5.33 HASP. SEQ (1, 12T7) Reading frames: l Enzyme file ALL.ENZ LinPage 3 



MN A 
NL P 
LA L 

13". 1 
/ 



HBN M 
GSS N 
IIP L 

AH2 1 

// 



TH E AS 

FN .C PC 

IF R YR 

II 2 11 

/ / 



TM 
RS 
UE 

■91 

it 



/ 



AACATGGGGTGCACTCTTATTCTTGAGGATTCCAGGAATAACTTTTTAATTCAGATGTTT 

. 4- . + . -+ . + . 4- . + 420 

TTGTACCCCACGTGAGAATAAGAACTCCTAAGGTCCTTATTGAAAAATTAAGTCTACAAA 



n m 



n 



v 
TM 

RS 

UE 

91 



N 
L 
A 
4 



M F 

A O 

E K 

2 1 



E 
C 
O 
B 



CATTACATTAAGACTTCTCTGGCTCCACTACCCTGCTACGTTTATCTGATTGAGCATCCT 
. + m +. . + . + . + . + 

GTAATGTAATTCTGAAGAGACCGAGGTGATGGGACGATGCAAATAGACTAACTCGTAGGA 



480 



- + 



S 
F 
A 
N 



M 

N 
L 
.1 



A 
V 
A 
2 



TtCCTCAAATATGCGACCACTCGTTCCATAGCCAAGTATCCTGTGGGTATAGAAGTTGGT 



ACfGGAGTTTATACGCTGGTGAGCAAGGTATCGGTTCATAGGACACCCATATCTTCAACCA 
s'jlkyjattrsiakypvg ievg 



540 



^0SEQ V5.3 3 HASP . SEQ ( 1 , 12 77 ) Reading frames: l 



D 


M 


M 


D 


A 


E 


BMDD 


D '. 


N 


N 


D 


L 


C 


IBPP 


E - 


. L. 


L 


E- . 


. U 


R 


NONN 


1 


* 1 


1 


1 


1 


V 


1121 



/ 



Enzyme file"' ALL. ENZ LinPage 4 



RS 
UE 
91 
7 



CCTCAGCCTCAAGGGGTTCTGAGAGCTGATATCTTGGATCAAATGAGAAAAATGATTAAA 

. + . + . + . + . + . 4- 

GGAGTCGGAGTTCCCCAAGACTCTCGACTATAGAACCTAGTTTACTCTTTTTACTAATTT 



600 



q g 



m 



m 



NN 
SL 
PA 
H3 
/ 



HMHM 
INHN 
NLAL 
Pill 
/ 



CATGCTCTTGATTTTATACATCATTTCAATGAAGGAAAAGAATTTCCTCCCTGCGCCATT 

. + # + . + . + . + . + 

GTACGAGAACTAAAATATGTAGTAAAGTTACTTCCTTTTCTTAAAGGAGGGACGCGGTAA 



660 



n 



E 
C 
P 
1 



g k e f p 


P c 


a 


i 










BSBNXSASSBBHNSB 


FF 


F 


IF 


SESCMMVCESSPCCB 


00 


0 


TN 


ACAIAAARCAAAIRV 


KK 


K 


AU 


J1J111111JJ2111 


11 


1 


1H 


/////////// 


/ 




/ 



GAGGTCTATAAAATTATAGAGAAAGTTGATTACCCCCGGGATGAAAATGGAGAAATTGCT 
. + . + . + . + . + . + 

CTCCAGATATTTTAATATCTCTTTCAACTAATGGGGGCCCTACTTTTACCTCTTTAACGA 



720 



n 



YZ3i? X 



&SEQ VS.* 3-3 HASP.SEQ(1, 127^^;eading frames: i Enzyme 



m 



s 

F 
C 
1 



PBMDD 
SIBPP 
TNONN 
11121 
//// 



F 
O 
K 
1 



e ALL.ENZ LinPage . 5 



ESASBBSBXBNMDDB 
CEPCSSFIHALBPPI 
RCYRAAANOMAONNN 
2111JJN12141211 

/// III iill I 



GCTATCATCCATCCTAATCTGCAGGATCAAGACTGGAAACCACTGCATCCTGGGGATCCC 

m h . + . + . + . H : - . + 

CGATAGTAGGTAGGATTAGACGTCCTAGTTCTGACCTTTGGTGACGTAGGACCCCTAGGG 



780 



n 



w 



N 
L 
A 
3 



TM 
RS 
UE 
91 
/ 



B MD^BBDBMA t BBAB CR CR 

B BPBSPPIBL SSCS SS SS 

V ONSCUNNOW ILIM PA PA 

2 121911122 Y112 61 61 

/ //// /// 



pj ATGTTTTTMCTCTTGATGGGAAGACGATCCCACTGGGjpGGAGACTGTACCGTGTACCCC 



. 1 . -I . 1 . 1 — 1 . 1 . h 

TACAAAAATTGAGAACTACCCTTCTGCTAGGGTGACCaGCCTCTGACATGGCACATGGGG 
mf ltldgktip lggdctvyp 



840 



-+- 



SM HIFA H A 

PN ATNC I L 

OL EAUI N U 

11 31H1 3 1 

/ * // Y- 

GTGTTTGTGAATGAGGCCGCATATTACGAAAAGAAAGAAGCTTTTGCAAAGACAACTAAA 



CACAAACACTTACTCCGGCGTATAATGCTTTTCTTTCTTCGAAAACGTTTCTGTTGATTT 
vfvneaayyekkea fak ttk 



900 



vSEQ.V5.33- HASP.SEQ(1, 1^^) Reading frames: 1 Enzym^ile' ALL.ENZ I>inPage^ 6 



S B H-IF - E A 

P B. NTN C L. 

0 V. FAU 1. ■ • U ■•; 

1 1 . 31H ' 5 . . . 1 

. ' * / 

CTAACGCTCAATGCAAAAAGTATTCGCTGCTGTTTACATTAGAAATCACTTCCAGCTTAC 

, + . + . + . + . + . + 

GATTGCGAGTTACGTTTTTCATAAGCGACGACAAATGTAATCTTTAGTGAAGGTCGAATG 



960 



n 



RM 
MA 
AE 
11 
/ 



A 
L 
U 
1 



ATM 
FRS 
LUE 
291 
/ 



ATCTTACACGGTGTCTTACAAATTCTGCTAGTCTGTAAGCTCCTTAAGAGTAGGGTTGTG 

. + . 4* . + . -f . 4- . + 

TAGAATGTGCCACAGAATGTTTAAGACGATCAGACATTCGAGGAATTCTCATCCCAACAC 



1020 



-f 



B 


A 


RM 


H 


D 


S 


S 


L 


MA 


N 


D 


F 


P 


U 


AE 


F 


E 


A 


W 


1 


11 
/ 


3 


1 


N 



CCTTATTCAACTGCATACATAGCTCCTAGCACAGTGCCTTATTCGGTAGGCATCTAAGCA 

. + . : -+ . + . -f . + . + 108 0 

GGAATAAGTTGACGTATGTATCGAGGATCGTGTCACGGAATAAGCCATCCGTAGATTCGT 

pys tayiapstvpysvgi . a 



■+ • +- . — — +— — . + . + . + 



i/SEQ V5.33 HASP. SEQ(1, 1277) Reading frames: 1 Enzyme file ALL.ENZ LinPage 7 

ThT ATM PATM TDM E A 

RS SRS ASRS RRS C L 

UE EUE CEUE UAE R U 

91 191 1191 911 V 1 

/ // /// / 

AATTTCTTAAATTAATTAATATATCTTTAAAGATATCATATTTTATGTATGTAGCTTATT 

. 4- . + . + . + . + . + 114 0 

TTAAAGAATTTAATTAATTATATAGAAATTTCTATAGTATAAAATACATACATCGAATAA 

nfln. 1 i y 1 . . ryhilcm. li 



X N A 

M L L 

N A U 

1 3 1 

CAAAGAAGTGTTTCCTATTTCTATATAGTTTATTATACATGATACTTGGGTAGCTCAACA 

. + . + . + . + . + . + 12 00 

GTTTCTTCACAAAGGATAAAGATATATCAAATAATATGTACTATGAACCCATCGAGTTGT 



. + . + . + . + . + . -f 

TM TM 

RS RS 

UE UE 

91 91 

/ / 
TTCTTAATAAACAGCCTTTGTATTCAGAATATAAAATTGAAATAGATATATATAAAGTTA 

. + . + . + . + . + . + 12 6 0 

AAGAATTATTTGTCGGAAACATAAGTCTTATATTTTAACTTTATCTATATATATTTCAAT 

flinslciqniklk. iyikl 

, + . + m + a : + ^ + # + 



AAAAAAAAAAAAAAAAA 

, + — 1277 

TTTTTTTTTTTTTTTTT 

k k k k k k 



-+- 



H 3 . 



